Reinforcement Learning with Internal Reward for Multi-Agent Cooperation: A Theoretical Approach

نویسندگان

Fumito Uwano

Naoki Tatebe

Masaya Nakata

Keiki Takadama

Tim Kovacs

چکیده

This paper focuses on a multi-agent cooperation which is generally di cult to be achieved without su cient information of other agents, and proposes the reinforcement learning method that introduces an internal reward for a multi-agent cooperation without su cient information. To guarantee to achieve such a cooperation, this paper theoretically derives the condition of selecting appropriate actions by changing internal rewards given to the agents, and extends the reinforcement learning methods (Q-learning and Pro t Sharing) to enable the agents to acquire the appropriate Q-values updated according to the derived condition. Concretely, the internal rewards change when the agents can only nd better solution than the current one. The intensive simulations on the maze problems as one of testbeds have revealed the following implications:(1) our proposed method successfully enables the agents to select their own appropriate cooperating actions which contribute to acquiring the minimum steps towards to their goals, while the conventional methods (i.e., Q-learning and Pro t Sharing) cannot always acquire the minimum steps; and (2) the proposed method based on Pro t Sharing provides the same good performance as the proposed method based on Q-learning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Theoretical considerations of potential-based reward shaping for multi-agent systems

Potential-based reward shaping has previously been proven to both be equivalent to Q-table initialisation and guarantee policy invariance in single-agent reinforcement learning. The method has since been used in multi-agent reinforcement learning without consideration of whether the theoretical equivalence and guarantees hold. This paper extends the existing proofs to similar results in multi-a...

متن کامل

Autonomous Learning of Reward Distribution in Not100 Game

In this paper, autonomous learning of reward distribution in multi-agent reinforcement learning was applied to the 4 player game named “not100”. In this game, more shrewd tactics to cooperate with the other agents is required for each agent than the other tasks that the learning was applied previously. The reward distribution ratio after learning was varied among simulation runs. However, the v...

متن کامل

Plan-based reward shaping for multi-agent reinforcement learning

Recent theoretical results have justified the use of potential-based reward shaping as a way to improve the performance of multi-agent reinforcement learning (MARL). However, the question remains of how to generate a useful potential function. Previous research demonstrated the use of STRIPS operator knowledge to automatically generate a potential function for single-agent reinforcement learnin...

متن کامل

Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action estimating system dynamics in terms of both the natural reward for task achievement and the virtual reward for cooperation. The virtual reward for cooperation is ascertained dynamically by a coordinating agent who est...

متن کامل

Autonomous Learning of Reward Distribution for Each Agent in Multi-Agent Reinforcement Learning

A novel approach for the reward distribution in multi-agent reinforcement learning is proposed. The agent who gets a reward gives a part of it to the other agents. If an agent gives a part of its own reward to the other ones, they may help the agent to get more reward. There may be some cases in which the agent gets more reward than that it gave to the other ones. In this case, it is better for...

متن کامل

ذخیره در منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

Reinforcement Learning with Internal Reward for Multi-Agent Cooperation: A Theoretical Approach

نویسندگان

چکیده

منابع مشابه

Theoretical considerations of potential-based reward shaping for multi-agent systems

Autonomous Learning of Reward Distribution in Not100 Game

Plan-based reward shaping for multi-agent reinforcement learning

Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

Autonomous Learning of Reward Distribution for Each Agent in Multi-Agent Reinforcement Learning

عنوان ژورنال:

اشتراک گذاری